Tracking Provenance in ORNL’s Flexible Research Platforms
نویسندگان
چکیده
Provenance is defined as information about the origin of objects, a concept that applies to both physical and digital objects and often overlaps both. The use of provenance in systems designed for research is an important but forgotten feature. Provenance allows for proper and exact tracking of information, its use, its lineage, its derivations and other metadata that are important for correctly adhering to the scientific method. In our project’s prescribed use of provenance, researchers can determine detailed information about the use of sensor data in their experiments on ORNL’s Flexible Research Platforms (FRPs). Our project’s provenance system, Provenance Data Management System (ProvDMS), tracks information starting with the creation of information by an FRP sensor. The system determines station information, sensor information, and sensor channel information. The system allows researchers to derive generations of experiments from the sensor data and tracks their hierarchical flow. Key points can be seen in the history of the information as part of the information’s workflow. The concept of provenance and its usage in science is relatively new and while used in other cases around the world, our project’s provenance differs in a key area. To keep track of provenance, most systems must be designed or redesigned around the new provenance system. Our system is designed as a cohesive but separate entity and allows for researchers to continue using their own methods of analysis without being constrained in their ways in order to track the provenance. We have designed ProvDMS using a lightweight provenance library, Core Provenance Library (CPL). In addition to keeping track of sensor data experiments and its provenance, ProvDMS also provides a web-enabled visualization of the inheritance.
منابع مشابه
Domain-specific summarization of Life-Science e-experiments from provenance traces
Translational research in Life-Science nowadays leverages e-Science platforms to analyse and produce huge amounts of data. With the unprecedented growth of Life-Science data repositories, identifying relevant data for analysis becomes increasingly difficult. The instrumentation of e-Science platforms with provenance tracking techniques provide useful information from a data analysis process des...
متن کاملDomain-specific summarisation of Life-Science e-experiments from provenance traces
Translational research in Life-Science nowadays leverages e-Science platforms to analyse and produce huge amounts of data. With the unprecedented growth of Life-Science data repositories, identifying relevant data for analysis becomes increasingly difficult. The instrumentation of e-Science platforms with provenance tracking techniques provide useful information from a data analysis process des...
متن کاملMulti-Scale Science: Supporting Emerging Practice with Semantically Derived Provenance
Scientific progress is becoming increasingly dependent on our ability to study phenomena at multiple scales and from multiple perspectives. The ability to recontextualize thirdparty data within the semantic and syntactic framework of a given research project is increasingly seen as a primary barrier in multi-scale science. Within the Collaboratory for Multi-Scale Chemical Science (CMCS) project...
متن کاملRosemary: A Flexible Programming Framework to Build Science Gateways
The lessons learned during six years of experience in design, development, and operation of four Science Gateway (SG) generations motivated us to develop yet another generation of platforms coined “Rosemary”. At the core of Rosemary the three fundamental SG functions, namely related to data, computing, and collaboration management, are integrated together. Our earlier studies showed that comple...
متن کاملOceanographic Data Provenance Tracking with the Shore Side Data System
The importance of tracking the provenance of electronic data becomes apparent when data set providers need to also provide metadata describing where the data came from. This need has driven the development of a practical oceanographic data provenance system at the Monterey Bay Aquarium Research Institute. MBARI’s Shore Side Data System is designed to manage data collected, processed, and archiv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013